Robot Learning With Crash Constraints

نویسندگان

چکیده

In the past decade, numerous machine learning algorithms have been shown to successfully learn optimal policies control real robotic systems. However, it is common encounter failing behaviors as loop progresses. Specifically, in robot applications where undesired but not catastrophic, many struggle with leveraging data obtained from failures. This usually caused by (i) failed experiment ending prematurely, or (ii) acquired being scarce corrupted. Both complicate design of proper reward functions penalize this letter, we propose a framework that addresses those issues. We consider violate constraint and address problem crash constraints, no upon violation. The no-data case addressed novel GP model (GPCR) for combines discrete events (failure/success) continuous observations (only success). demonstrate effectiveness our on simulated benchmarks jumping quadruped, threshold unknown priori. Experimental collected, means constrained Bayesian optimization, directly robot. Our results outperform manual tuning GPCR proves useful estimating threshold.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robot Calibration with Planar Constraints

This paper investigates issues related to robot calibration with planar constraints and, in particular observability conditions for the parameters of the kinematic model of the robot. Both singleand multiple-plane constraints for robot calibration are considered. It is shown that a single-plane constraint is n o d y insdlicient for calibrating a robot. It is also shown that by using a three-pla...

متن کامل

Towards robot incremental learning constraints from comparative demonstration

This paper presents an attempt on incremental robot learning from demonstration. Based on previously learnt knowledge about a task in simpler situations, a robot learns to fulfill the same task properly in a more complicated situation through analyzing comparative demonstrations and extracting new knowledge, especially the constraints that the task in the new situation imposes on the robot’s be...

متن کامل

Learning Robot Skills Through Motion Segmentation and Constraints Extraction

Learning skills while interacting with human users allows robots to work in human environments and become efficient when performing everyday tasks. This paper presents a method of analyzing data obtained from human demonstrations of a task in order to: (a) extract specific constraints for each part of the task (important variables, the frame of reference to be used, and the most suitable contro...

متن کامل

Learning with Violable Constraints

This chapter reviews major results as well as areas of ongoing research on computational models of learning with violable constraints, in the sense of Optimality Theory. The review encompasses learning in classic OT as well as learning in related frameworks that formalize constraint interaction as (probabilistic) weighting or ranking. The learning problem is decomposed into several subproblems,...

متن کامل

Online Learning with Constraints

We study online learning where the objective of the decision maker is to maximize her average long-term reward given that some average constraints are satisfied along the sample path. We define the reward-in-hindsight as the highest reward the decision maker could have achieved, while satisfying the constraints, had she known Nature’s choices in advance. We show that in general the reward-in-hi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2021

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2021.3057055